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imental  situations  naturally  lead  to  observations  classified  into  certain 
groups.  The  process  of  recording  or  storing  observations  directly  leads 
to  grouped  data.  Other  examples  of  circumstances  which  lead  to  grouped 
data  have  recently  been  discussed  by  Haitovsky  (1973)  in  his  mongraph  on 
regression  estimation  from  grouped  observations.  From  a practical  point 
of  view,  the  case  for  grouped  data  has  been  advocated  by  Durbin  (195M  as 
a technique  for  minimizing  errors  of  measurements.  Grouped  observations 
also  result  in  experiments  where  precise  measuring  instruments  are  not 
available. 

There  is  an  extensive  literature  on  the  study  of  grouped  data.  The 
estimates  of  probability  density  function  as  a histogram,  by  the  very  nature 
of  the  problem  require  grouped  observations.  Standard  tests  of  goodness 
of  fit  require  grouping  of  data  for  performing  tests  such  as  those  using  the 
Chi  squared  statistic.  The  effect  of  grouping  in  regression  problems  has 
been  studied  by  many  authors  and  Haitovsky  has  provided  a large  list  of 
references.  The  computational  problems  involved  in  obtaining  exact  solutions 
become  difficult.  In  this  paper,  approximate  estimates  of  the  parameters 

in  the  regression  models  have  been  proposed.  Certain  optimal  properties  of 
these  estimates  are  discussed  end  a comparison  of  these  estimates  with  other 
competing  ones  is  given.  Simulation  experiments  show  that  the  approximate 
estimates  based  on  grouped  data  compare  well  with  those  based  on  ungrouped 
data  under  certain  cases. 
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2.  MODEL  AND  NOTATION 


We  consider  K populations  generated  by  the  random  variables  Y^,  Y?) 
Suppose  the  possible  values  of  the  randan  variables  occur  in  the  disjoint 
intervals  (c  , c ) (c  c )...(c  e ).  and  only  the  frequencies  of  the 

O 1 It  1-11 

randan  variables  are  known  in  these  intervals.  We  use  the  following  notation: 
(a)  = number  of  Y^'s  in  the 

interval  (c^j  e^, 

i - 1,  2, , . , ,1,  k - 1,  2, . . . ,K.  (2.1) 


(b)  be  the  standard  normal  density 

-m  • T a 

*00  “ | ^e'  dt. 
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Let  be  normally  distributed  with  mean  and  variance  a with 

(c)  nk  =Bl  V*  S2  *k2  +-"+  Pp  *1®  ' 
k « 1,  2,...,K. 

<d>  q* 


ci-^ 


<•>  v 


(2.2) 

(2.3) 


(8.4) 

(2.5) 

(2.6) 


The  likelihood  of  the  sample  of  size  n^,  n,,,...,!^  from  the  K populations, 
is  the  product  of  multinomials,  given  by 


-TT  (r  'f  ' 

k-1  VBlk'N2k” 


N 


TT. 


Ik 


Ik’ 


lk 


(2.7) 


with 


"k  * ”lk  + N2k  +"'+  HIk 


K 

n W "k 

k**l 


and 


2V' 


e - (3X  e2  ...  pp  a ) 
is  the  (p+1)  dimensional  vector  of  parameters. 


(2.8) 


The  likelihood  equations  for  estimating  £ are  obtained  by  differentiating 

2 

(2.7)  with  respect  to  3^  3g,...,3  and  o . We  then  have,  using  (2.6) 


N41[[4(Clk)- *CCi_i  k)] 

1 k •«!»)- k>  ‘ 

i =»  1,  2, . . . ,p 

and  , , 

liiikllliW  . o 

i k *(tlk)  - *CCi.1  k) 


(2.9) 


(2.10) 


The  form  of  the  above  likelihood  equations  makes  the  estimation  of  the 

parameters  an  involved  problem.  In  the  case,  p-1,  Gjeddeback  (19^9)  has  derived 

the  likelihood  equations  (2.9)  and  (2.10).  By  introducing  certain  tabulated 

functions,  he  was  able  to  solve  the  equations  in  case  p=l.  Later  Kulldorff 

(1961)  gave  conditions  under  which  the  maximum  likelihood  estimates  exist 

and  are  unique  roots  of  the  likelihood  equations  for  p-1,  for  an  estimate  of 

2 

3^  as  well  as  for  an  estimate  of  a separately.  Simultaneous  estimation  of 
2 

3^  and  a was  not  attempted. 

Using  approximations  to  TT^,  an  alternative  approach  is  given  here  to 
find  maximum  likelihood  estimates  of  the  parameters  in  the  case  of  the  multiple 
regression  model.  Several  questions  concerning  the  uniqueness  of  solutions 
and  the  convergence  of  the  solutions  obtained  by  iterative  procedures  are 
still  under  study.  The  desirability  of  using  approximate  estimates  arises 
from  the  fact  that  they  have  certain  nice  properties,  some  of  which  are 
discussed  here. 
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3.  APPROXIMATE  MAXIMUM  LIKELIHOOD  ESTIMATES 


The  general  approach  here  ia  to  assume  that  certain  terms  in  the 
expansion  of  the  probabilities  TT^  are  negligible.  For  this  purpose,  we 
assume  that  the  intervals  (c^  c^),  i = 1,  2,..,, I are  of  the  same  length 

h.  That  is 


Cj^  - m h,  1 = 1,  2, . . . ,1  (3*1) 

We  assume  that  the  numbers  of  intervals,  I,  ia  infinite.  Let  be  the 
middle  point  of  the  interval  (c^  c^),  such  that  the  random  variable 

M may  be  defined  with 

P(M  - - TTik,  (3.2) 

and  let  ■ (m^  - u^)/o,  i = 1,  2,..., I.  Note  that  M * if  and  (3.3) 
only  if  Cj  ^ < y < . We  have, 


V - *«ik  * - «?ik  - t 


'lk  * Sc  • (z  - ?.„)r 


J 


* h_  r=0 
Hk  ' 2 a 


r: 


dz 


5 ♦<?!*>  £ 


fhv2r 

■gf 2 H (?ik} 

t r\— . -a  \ • 2r 


r-0  2 *(2rfl)J 


(3.^ 


where  Hgr^lk^  is  a polynomial  of  order  2r,  The  detailed  study  of 
Hermits  polynomials  has  been  given  by  Kendall  and  Stuart  (1969  p.  155-156). 
Using  Taylor's  expansion,  we  have 


if 


log  nlk  . log  £ - log  jar  - 1 c2lk 


- -£>  <4  -» 


2k  a 


+ — S-  ^ik + Ugik  -2)  + 0 


2880a" 


Estimation  of 


The  likelihood  equations  are 
9 log  TT1V 

1 1 "ii  -5 ^ ' °-  t’1-s p' 


Using  (3.5),  the  equation  (3.6)  reduces  to 

k i <_  ^ ^ "5ik + i?  5i“ 

■ ^7  ^ + °'(7>  ■ ° 

Substituting  expressions  for  5 from  (3.3)  and  from  (2.4),  we 

IK  K 

simplification,  the  likelihood  equations  (3.7)  reduce  to 
S1  k J "ik  Tklxkl  * 62  J £ "ik  xkl*k2  +‘ • • 

* »p  “ "ik  *k«*k2 

■“  "u  Vki  * °i  <-*>• 
i k a 


(3.5) 


(3.6) 


(3.7) 

have  on 


(3.8) 
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Ignoring  terms  0^(— jj-) , the  likelihood  equations  are  standard  equations  in 
a 

the  form 


A . . -1 

6c  * (n't)  */  R. 


(3-9) 


where  jj  is  the  vector  of  mid  points  of  the  intervals  and  x^  is  the  matrix 

of  independent  variables  x^.  x^  is  assumed  to  be  of  full  rank. 

2 

Estimation  of  0 with  the  above  approximation  is  obtained  by  the 
equation 


£ I N 

i k 


ik 


12  o 


(-n  n . . 

n i k ik 


*ik 


•1) 


(3.10) 


Let  be  estimated  by 


(3- 11'' 


and  let  us  denote  by 


8°2  = " i k 


£ N 


ik 


(m. 


(3.12> 


The  equation  (3-10)  reduces  to 


1 - ¥ + -72  <2  ¥ - d - ° 

o 12a  a 


(3.13) 


leading  to  an  estimate  (within  0(— ^)) 


6 


• - *02  [>  ' ^ 


(3.X4) 


There  are  other  methods  of  obtaining  approximate  eatlmates  of 

2 

and  o , given  by  Tallis  (1967)  and  fryer  and  Pathybridge  (1972).  These 
author*  have  uaed  iterative  procedures,  arriving  at  the  same  results  as 
(3.9)  and  (3.I1*).  In  the  calculations  of  these  estimates  by  the  above 
method,  iteration  is  avoided  and  consequently,  there  is  considerable 
amount  of  saving. 

hh 

With  the  above  approximation,  that  is,  within  the  terms  of  O(^), 

a 

the  estimates  correspond  to  ordinary  least  squares  estimates.  Such 
estimates  are  known  to  have  many  desirable  properties. 

We  also  note  that  to  the  order  of  approximation  implied  in  Shappard 
corrections, 

*<i,>  * B.  <3-«l 

and  Cov  (f^)  • (x,**^-1  c2  (l  + -^g)  . (3.16) 

12  a 

Sheppard  corrections  also  lead  to  the  estimates  (3*16)  when  it  is  asstned 
that  h is  small.  Here  we  are  assuming  that  h/o  is  small.  The  estimates 
obtained  by  the  above  method  eliminate  the  need  of  iteration  and  can  be 
computed  by  available  computer  programs. 

A 

The  loss  of  efficiency  in  estimating  g,  by  the  mid  point  estimator  6^ 

as  compared  to  the  regular  ungrouped  estimator  ^ is  approximately 
1 h2 

■jtj  * -£■.  By  a proper  choice  of  h,  therefore,  the  efficiency  loss  can 

a 

be  made  as  small  as  one  would  like.  In  mortality  studies,  where  grouped 
data  are  usually  obtained,  the  loss  of  efficiency  can  be  compared  with 
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the  coat  of  experiment.  for  ioh  further  interesting  results,  tha 
reeder  uy  refer  to  Thempson  (1977K 


4.  SIMULATION  RESULTS 

Comparison  of  approximate  estimates  with  the  actual  estimates  is 
not  easy  since  the  actual  estimates  under  grouping  require  tedious  com- 
putations. We  shall  compare  here  the  estimates  for  certain  simulated 
experiments  under  the  assumptions  of  ungrouped  and  grouped  observations. 
Two  experiments  ware  performed.  The  first  experiment  is  concerned 

with  the  following  model.  It  is  assumed  that  y are  normally  distributed 

2 

with  the  same  variance  a , and 
E(ylk)  - • * 6 ^ 

h 39  1 , 2^ . . . yK.  i * 1 , 2) . . . . 

Only  a selected  values  of  the  parameters  were  considered.  Normal  random 
deviates  were  chosen  for  each  possible  combination  of  parameters  and  100 
replications  of  the  samples  were  obtained.  The  following  parameter  values 
were  selected  as  given  in  Table  1. 


Table  1 


ao 

al 

2 

0 

(i) 

10 

0 

25 

(ii) 

40 

0 

25 

(iii] 

10 

1 

100 

(iv) 

40 

0 

100 

for  h ■ 0,  2,  ?,  4,  5,  10,  15,  20 


Also  the  number  of  observation*  chosen  were  ■ 10  and  100  and  K a iq. 


*1  * °»  X2 


1,...,x^q  ■ 9.  The  results  are  given  in  tables  2-5- 


The  second  model  considered  is  the  following. 

ylk  - ' 

i * k * 1,  2,...,K  • 

We  consider  the  cases  for 


P1  ■ o> 


&2  “ 0, 


f33  * 0, 

2 

c * 10,000  and 
Y - 10,  25,  300, 

K a 4. 

The  3xn  matrix  Is  chosen  to  be 

^fo..  .022.  ..277.. 


X.' 


[1  1..  .144.  ..455 

33.  ..  3 66.  ..633 

V 


. 7 9 9 ...  9 
.511.  • • 1 
.388.  . ,8 


h * 0,  10,  50,  100,  200. 


Tables  6-9  provide  the  estimates  of  the  parameters  ^ and  0 . The  ungrouped 
observations  are  obtained  when  h * 0.  The  tables  show  clearly  that  mid 
point  estimates  correspond  very  well  with  the  ungrouped  estimates. 


5.  APPLICATION 

The  theory  developed  above  has  been  applied  to  a study  recently 
completed  by  Student  Health  Services  of  the  Ohio  State  University.  The 
data  were  collected  in  1975  to  study  blood  pressure  levels  and  their  cor- 
relates for  the  patients  visiting  the  Student  Health  Services.  In  addition 
to  Systolic  and  Diastolic  blood  pressures,  other  physiological  variables 


k 
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Table  2:  Mean  and  S.h.  of  ungrouped  and  mid-point 

esimates  of  a,  obtained  in  the  simulation- 
mode 1 ( i ) , n 1 - 1 0 


h 

Mean 

S.  U. 

■ 0^=25 

0 

9.9565 

1.0006 

? 

9.9528 

1.0120 

3 

9.9691 

1.0401 

4 

9.9388 

1 . 0094 

5 

9.9344 

1.0454 

10 

9.9705 

1.1260 

15 

9.4664 

1.2471 

20 

9.9618 

0.7274 

n 7.  — T c 

U 4 D 

0 

39.9387 

0.9144 

2 

39.9409 

0.9330 

3 

39.9563 

0.9238 

4 

39.9081 

0.9475 

5 

39.8800 

0.9328 

10 

39.8169 

0.9069 

15 

39.5888 

1.3240 

20 

39.6673 

1.6212 

. _ /-t  2 inn 

0 

10.1800 

1.9060 

2 

10.1743 

1.9352 

3 

10.1605 

1.9118 

4 

10.2079 

1. 9485 

5 

10.1628 

1 .9267 

10 

10.1169 

2.0492 

15 

10.1877 

2.0107 

20 

10.2509 

2.3219 

ri  2 - 1 r>  A 

— <J  - 1 u u 

0 

40.1767 

2.0242 

2 

40.1545 

2.0245 

3 

40.1892 

2.0221 

4 

40 .1753 

2.0289 

5 

40.1889 

1.9825 

10 

40. 2031 

2.0785 

15 

40.1250 

2.2190 

20 

40.3545 

2.3866 

(S.h.  )7(l*h2/l2o2n 


a = 1 0 


a = 40 


1.0006 
1 .0073 
1.0155 
1.0270 
1.0415 
1 .1554 
1.3237 
1 . 5285 


0.9144 

0.9204 

0.9280 

0.9384 

0.9517 

1.0558 

1.2150 

1.3967 


1 . 9060 
1.9092 
1.9131 
1.9187 
1.9258 
1.9838 
2.0770 
2.2009 


2 .0242 
2.0276 
2.0318 
2.0377 
2.0452 
2.1069 
2.2059 
2.3374 


(S.E.)  for  ungrouped  data,  i.e.,  for  h=0 


Table  3: 


Mean  and  S.F..  of  ungroupcd  and  mid-point  estimates 
of  6,  obtained  in  the  simulation-model  (i),  nj^lO 


h 

Mean 

S . I: . 

(s.n 

.)hl+h2/(12o2)) 

2 - 7 c 

rv  = 1 n - - 

O e J 

CX  I w 

0 

0.0063 

0.1951 

0. 1951 

2 

0.0061 

0.1971 

0.1964 

3 

0 . 0037 

0.1994 

0.1980 

4 

0.0100 

0.1942 

0.2002 

5 

0.0034 

0.2072 

0.2030 

10 

-0.0015 

0.2229 

0.2253 

15 

-0.0056 

0.2453 

0.2581 

20 

0.0018 

0.1506 

0.2980 

n 7 ~ i c 

Ct  H u 

0 

0.0124 

0.1532 

0.1532 

2 

0.0125 

0.1598 

0.1542 

3 

0.0109 

0.1553 

0.1555 

4 

0.0150 

0.1631 

0.1572 

5 

0.0193 

0.1608 

0.1595 

10 

0.0289 

0.1591 

0.1769 

15 

-0.0082 

0.2247 

0.2027 

20 

0.0579 

0.2941 

0.2340 

n 2 - 1 n n 

I n 

0 

-0.0225 

0.3391 

0.3391 

2 

-0.0216 

0.3440 

0.3397 

3 

-0.0183 

0.3397 

0.3404 

4 

-0.0265 

0.3434 

0.3414 

5 

-0.0205 

0.3413 

0.3426 

10 

-0.0204 

0.3613 

0.3530 

15 

-0.0233 

0.3521 

0.3695 

20 

-0.0331 

0.4005 

0.3916 

„ 2- i nn 

™ - a n - - - 

U 1 uu 

0 

-0.0266 

0.3680 

0.3680 

2 

-0.0231 

0.3678 

0.3686 

3 

-0.0290 

0.3719 

0.3694 

4 

-0.0289 

0.3681 

0.3704 

5 

-0.0267 

0.3622 

0.3718 

10 

-0.0289 

0.3744 

0.3830 

15 

-0.0010 

0.3991 

0.4010 

20 

-0.0614 

0.4237 

0.4249 

(S.II.)  for  ungrouped  data,  i,e.,  for  h-0 
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Table  4: 


Mean  and  S . T: , of  ungrouped  and  mid-point 
estimates  of  a,  obtained  in  the  simulation- 
model  ( i ) , n^= 100 


h 

Me  a n 

S.h.  (S.l 

i.J  V(l+h2/(12u2)) 

_ ,-f  2-7  Q 

- 1 H _ 

0 

6.9964 

0.3066 

0.3066 

2 

9.9933 

0.3047 

0.3086 

3 

10.0006 

0.3088 

0.3112 

4 

9.9916 

0.3068 

0.3147 

5 

9.9885 

0.3258 

0.3191 

10 

10.0075 

0.3706 

0.3540 

15 

9.5020 

0.3555 

0.4056 

20 

9.9869 

0.2672 

0.4683 

^ _ _ 

O u ») 

0 

40.0149 

0.2892 

0.2892 

40.0157 

0.2916 

0.2911 

3 

40.0187 

0.2934 

0.2935 

4 

40.0156 

0.2846 

0.2968 

5 

40.0263 

0.3044 

0.3010 

10 

40.0323 

0.3227 

0.3339 

15 

39.6091 

0. 3658 

0.3826 

20 

40.0225 

0.5825 

0.4418 

2 - 1 n n 

Ct  lu 

0 

9.9118 

0.6253 

0.6253 

2 

9.9156 

U . 6259 

0.0263 

3 

9.9168 

0.6239 

0.6276 

4 

9.9184 

0.6359 

0.6294 

5 

9.9153 

0.6302 

0.6318 

10 

9.9069 

0.6501 

0.6508 

15 

9.9125 

0.7256 

0.6814 

20 

9.9275 

0.6911 

0.7220 

_ nn 

™ - a n __  __  „ „ - 

0 

39.9574 

0.5485 

0.5485 

2 

39.9616 

0.5509 

0.5494 

3 

39.9559 

0.5485 

0.5505 

4 

39.9684 

0.5648 

0.5521 

5 

39.9627 

0.5649 

0.5541 

10 

39.9619 

0.5850 

0.5709 

15 

39. 9344 

0.6133 

0.5977 

20 

39.9755 

0.6747 

0. 6533 

1 cs.n.) 

for  ungrouped  data,  i.e.,  for 

h=0 

Table  5: 


Mean  and  .8.1:.  of  ungrouped  and  mid -point 
estimates  of  B,  obtained  in  the  simulation- 
model  ( i ) , n^  = 1 00 


ll 

Mean 

s.i:.  (s.i-:. 

. )V(l  + h2/(12o2)) 

„ 7 - ? c 

- i n - 

0 

-0 . 0027 

0.0530 

0.0530 

2 

-0 . 0019 

0.0530 

0.0533 

3 

-0 . 0038 

0.0537 

0.0538 

4 

-0.0012 

0.0  54  0 

0.0544 

5 

-0.0019 

0.0573 

0. 0551 

10 

-0.0036 

0.0644 

0.0612 

15 

0.0024 

0.0660 

0.0701 

20 

0 . 0006 

0.0484 

0 . 0809 

rT  2 - 9 c 

rj  ~ a r 

0 Ct  j 

Ct  - 4 u — — 

0 

-0.0034 

0.0563 

0.0563 

2 

-0.0038 

0.0567 

0.0567 

3 

-0.0043 

0.0559 

0.0591 

4 

-0.0048 

0.0554 

0.0578 

5 

-0.0044 

0.0598 

0.0586 

10 

-0.0052 

0.0619 

0 . 0650 

15 

-0.0122 

0.0703 

0.0748 

20 

-0.0042 

0.1082 

0.8598 

„ 7 inn 

.-in 

O 1 uu 

0 

0.0183 

0.1241 

0.1241 

2 

0.0177 

0.12  37 

0.1243 

3 

0.  0186 

0.1225 

0.1245 

4 

0.0172 

0.1249 

0.1249 

5 

0. 0178 

0.  1256 

0.1253 

10 

0.0169 

0 . 1309 

0.1291 

15 

0.0177 

0.1385 

0.1352 

20 

0.0114 

0. 134  7 

0.1432 

n 2 - l nn 

0 

-0.0013 

0.  1058 

0.1058 

2 

-0. 0018 

0.  1063 

0.1060 

3 

-0.0001 

0.  1046 

0.  1062 

4 

-0.0034 

0. 1080 

0.1065 

5 

-0.0026 

0.1082 

0.  1069 

10 

-0. 0009 

0.1122 

0.1101 

15 

0. 0019 

0.1162 

0.1153 

20 

0.0023 

0.1275 

0.1221 

J cs.i- 

. ) for  ungrouped  data,  i.e. 

, for  h=0 
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Table  61  Mean  and  S.li.of  ung  rouped  and  mid-point  estimates 
of  and  83— model  (ii),  o=I00 


1 

ii 

M e a n 

1 •; 

j i 

1 

3l 

b2 

h 

p 

T*  , - T fi  -v-  oil  V 

j 

1 

ni  i u r or  all  k 

‘ | 

1 

1 

0 

0. 7193 

- 1 .5555 

0.0191 

i 

10 

0.7110 

-1.4928 

-0.0219 

t 

50 

0. 6257 

-1.2162 

-0.1175 

i 

100 

0. 6832 

-0.7006 

-0. 5402 

1 

200 

0. 2899 

-0.8026 

-0. 2323 

•1 

-* 

l 

\ 

ill^  L J 1 U 1 dll  N 

4 

l 

l 

0 

-0.3682 

2 . 2 3 S S 

-0. 9839 

>. 

1 

10 

-0.3674 

2.2426 

-0.9911 

3 

50 

- 0.3822 

2.4762 

-1.1478 

i 

r 

100 

-0.3151 

2.4754 

-1 . 2884 

i 

i 

200 

-0.2097 

1.5684 

-0.7932 

ily,’"'iUU  IOT  dll  K 

0 

0.1319 

-0.2491 

0.0450 

i 

10 

0.1283 

-0.2353 

0.0370 

\ 

1 

50 

0.1679 

-0.2946 

0.0333 

. 

s 

100 

0.2409 

-0.3710 

0.0321 

' 

i 

200 

0.2629 

-0.4829 

0.0933 

I 


Table  7". 


Covariance  matrix  of  the  ungrouped  and  the  mid 
point  estimates  of  3 and  the  values  of 
(x'x)  - 192(1  + 1^/(12  52) ) in  parentheses  for 
comparison-model  (ii),  nk=10,  o=100. 


A A 

3 1 B? 


Ungrouped  data 


Bl 

60.84 

-183. 59 

57  . 

, 2 9 

^2 

-183. 59 

803. 98 

- 326. 

, 87 

^3 

57.29 

-326.87 

156. 

. 92 

Croups  of  length  h=10 


f 

I 


60.67(60.89)  - 1 83 . 1 6 ( - 1 83 . 7 S)  57.16  (57.3  4) 

183. 1 6 ( - 1 8 3 . 75)  802.72  (804.65)  - 32 6 . 30  ( - 3 2 7 . 14 ) 

57.16(57.34)  - 3 2 6 . 30  ( - 3 2 7 . 1 4 ) 1 56.57  (157.05) 

Groups  of  length  h=50  

62.49(62.10)  -187. 20 (-187. 39)  57.50(58.48) 

18  7 . 20  (-  187. 39)  8 2 3.  52  (820 . 63)  - 333 . 8 6 ( - 333 . 63) 

5 7 . 50  (58.48)  - 333 . 86 ( - 3 3 3 , 63)  160.98  (160.17) 

Groups  of  length  h=100  


64.30(65.88)  -191.02(-198.80)  58.45(62.04) 

191 . 02  (-198. 80)  842.92  ( 870.57)  - 34 1 . 8 3 ( - 3 53 . 94 ) 

58.45(62.04)  -341. 83 (-353. 94)  165.69(169.91) 

Groups  of  length  h=200  

77.20(81.00)  -233. 75 (-244 . 42)  70.13(76.27) 

233. 7 5 ( - 244 . 4 2)  10  61.75(1070.  33)  - 4 3 5 . 38  ( - 4 3 5 . 1 5) 

70.13(76.27)  -435. 38(-435.  15)  21  7..  81  (208 . 90) 


Table  8: 


Covariance  matrix  of  the  ungrouped  and  the 
mid-point  estimates  of  g and  the  values  of 
(x'x) o 2(1+ h 2/(i 2a 2))  in  parentheses  for 
comparison-model  (ii),  np=25,  o=100 


Pi 


Pi 

17. 

82 

A 

e2 

-57. 

17 

A 

B3 

20. 

72 

A 

6l 

17. 

69(17. 

83) 

62 

- 56 

81 ( - 57 

. 22) 

A 

63 

20 

68(20. 

74) 

A 

Bl 

17 

12(18. 

19) 

B2 

- 55 

23 ( - 58 

. 36) 

A 

B3 

20 

17(21. 

15) 

A 

Bl 

18 

03(19. 

30) 

b2 

- 56 

92 ( - 61 

.93) 

B3 

20 

.55(22. 

45) 

Bl 

21 

.87(23. 

76) 

b2 

- 63 

6 5 ( - 7 6 

.23) 

A 

B3 

21 

94  ( 27  . 

63) 

82  63 


Ungrouped  data 


-57.17 

20.72 

257.42 

-111.59 

-111.59 

55.42 

Croups  of  length 

h=  10  

- 56.81  (-  57.22) 

20.68(20.74) 

256.56(257.63) 

-111. 53 ( - 1 1 1 . 68) 

-111 . 53 ( - 1 11 . 68) 

55.42(55.47) 

Groups  of  length 

h=  50  - 

- 55 . 23  (-  58. 36) 

20.17(21 .15) 

255.82(262.78) 

-112. 54 (-113. 91) 

- 1 1 2 . 54  ( - 1 13 . 91 ) 

56. 56(56.57) 

Groups  of  length 

h=100 

- 56 . 92 ( - 61 . 93) 

20 . 55(22.45) 

263.66(278.87) 

-116. 89  ( - 1 2 0 . 89) 

- 1 16 . 89 ( - 1 20 . 89) 

59. 58(60.04) 

Groups  of  length 

h = 20 0 

-63. 6 5 (-76. 23) 

21.94(27.63) 

307.61 (343.23) 

- 39. 4 0 ( - 148. 79) 

- 139. 40(-148. 79) 

73.75(73.89) 
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Table  9:  Covariance  matrix  of  the  ungroupcd  and  the  mid- 
point estimates  of  g and  the  value  of 
(x'x)  'TQ2  U + h2/ (12  02))  in  parentheses  for 
comparison-model  (ii),  ng=100,  o = 100 


§1 


h 

4.02 

0 2 

-11.94 

A 

33 

3.67 

A 

01 

4.04(4.02) 

A 

32 

-12.00(-11.95) 

33 

3.70(3.67) 

Si 

4.10(4.10) 

-12. 20 ( - 1 2 . 19) 

33 

3.83(3.75) 

h 

4.28(4.35) 

32 

-12. 55  C- 12 . 93) 

33 

3.99(3.98) 

A 

01 

4,79(5.36) 

A 

32 

-14.14(-15.92) 

33 

4.47(4.89) 

3 2 

3 3 

Ungrouped  data  -- 

-11.94 

3.67 

58.  OS 

-24.35 

-24.35 

12.18 

Groups  of  length  h 

= 10  

-12.00C-11.95) 

3.70(3.67) 

58. 38(58.10) 

-24.50(-24.37) 

-24. 50  (-24 . 37) 

12.24(12.19) 

Groups  of  length  h 

= 50 

-12.  20  (-12. 19) 

3.83(3.75) 

58.25(59.26) 

-24.42(-24.86) 

-24.42(-24.86) 

12.15(12.43) 

Groups  of  length  h 

= 100  - 

-12.55C-12.93) 

3.99(3.98) 

61.15(62.89) 

-26. 21  (-  26. 38) 

-26. 21 (-26. 38) 

13.24(13.20) 

Groups  of  length 

h=  200  

-14.14(-15.92) 

4.47(4.89) 

69.39(77.40) 

-29. 61 (-32. 47) 

- 29.61  (-  32. 47) 

15.18(16.24) 

IT 


and  fully  history  of  hypertension,  blood  and  sugar  levels  of  urine 
were  also  recorded.  We  study  the  dependence  of  systolic  and  diastolic 
blood  pressure  levels  on  age  and  sex,  only  for  the  sake  of  illustration. 
The  paucity  of  observations  on  other  variables  does  not  allow  extensive 
analysis  of  the  data.  Persons  between  ages  of  18  and  28  were  involved 
in  the  study  and  there  were  about  4,000  observations. 

The  bl"od  pressure  levels  were  grouped  into  two  different  intervals-- 
intervals  of  size  5 nm  Hg  and  10  mm  Hg.  Since  digit  preference  in  record- 
ing and  reading  blood  pressure  measuring  instruments  is  well  documented, 
see  for  reference  Bcrhani  et.  al.  (1969),  in  view  of  this,  the  size  of 
interval  equal  to  10  nm  Hg  seems  quite  reasonable  and  the  error  will  be 
eliminated  if  we  choose  the  size  equal  to  10.  Table  10  provides  the 
estimates  with  the  standard  errors  for  the  parameters  of  the  model  for 
systolic  and  diastolic  pressure  given  by  the  equations. 

Systolic  Pressure  * + B^Age)  + S2(Sex)  + error 

Diastolic  Pressure  * YQ  + Y^Age)  + Yg(Sex)  + error 
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Table  10:  Estimates  of  the  Parameters  (Estimate  + Standard  Error) 


Parameter 

Systolic  Pressure 

h - 0 

h - 5 

h * 10 

111.71  + 1.54 

112.72  + 1.55 

113.77  + 1.58 

^1 

0.05  + 0.07 

0.04  +0.07 

0.01  + 0.08 

P2 

11.5k  t °*39 

11.61  + 0.39 

11.69  + 0.40 

Diastolic  Pressure 

h » 0 

h - 5 

h - 10 

V0 

62.71  + 2.27 

63.65  + 2.28 

64.95  + 2.31 

V1 

0.U7  + 0.11 

0.47  + 0.11 

0.43  + 0.11 

_ ■ 

4.84  i 0.57 

4.88  i 0.57 

4.09  ± 0.58 
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